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The development of a high-performance parallel system or application is an evolutionary process. It may 
begin with models or simulations, followed by an initial implementation of the program. The code is then 
incrementally modified to tune its performance ... 
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Extracting high-performance from the emerging Chip Multiprocessors (CMPs) requires that the applicatic 
be divided into multiple threads. Each thread executes on a separate core thereby increasing concurrenc 
and improving performance. As the number ... 
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Runtime monitoring support serves as a foundation for the important tasks of providing security, 
performing debugging, and improving performance of applications. Often runtime monitoring requires the 
maintenance of information associated with each of ... 
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In this paper a highly flexible and scaleable multiband impulse radio UWB architecture for high data rates 
is described and evaluated. The investigations are mainly focused on on-off-keying modulation combined 
with a low-complexity non-coherent energy ... 
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We present occlusion-switches for interactive visibility culling in complex 3D environments. An occlusion- 
switch consists of two GPUs (graphics processing units) and each GPU is used to either compute an 
occlusion representation or cull away primitives ... 
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The EECS 373 "Design of Microprocessor- based Systems" course at the University of Michigan ties 
hardware and software together by providing a modern platform on which students simultaneously 
develop both hardware and software components of simple systems. ... 
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Silicon technology will continue to provide an exponential increase in the availability of raw transistors. 
Effectively translating this resource into application performance, however, is an open challenge that 
conventional superscalar designs will not ... 
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This paper presents a new technique for global energy optimization through coordinated functional 
partitioning and speed selection for embedded processors interconnected by a high-speed serial bus. 
Many such serial interfaces are capable of operating ... 
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A brief history of the < u>goto</u> controversy (retention or deletion of the < u> goto< /u> statement) is 
presented. After considering some of the theoretical and practical aspects of the problem, a summary of 
arguments both for ... 
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A novel approach for testing embedded memories in complexsystems-on-a-chip (SOCs) is presented. The 
proposedsolution aims to balance the usage of the existing on-chipresources and dedicated design for 
test (DFT) hardwaresuch that the functional power ... 
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In this work, we propose a new FPGA design flow that combines the CUDA programming model from 
Nvidia with the state of the art high-level synthesis tool AutoPilot from AutoESL, to efficiently map the 
exposed parallelism in CUDA kernels onto reconfigurable ... 
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performance computing 
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We consider issues related to the reduction of scan test data in designs with multiple scan chains. We 
propose a metric that can be used to evaluate the effectiveness of procedures for reducing the scan data 
volume. The metric compares the achieved compression ... 
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A brief history of the goto controversy (retention or deletion of the goto statement) is presented. After 
considering some of the theoretical and practical aspects of the problem, a summary of arguments both ... 
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Single-assignment languages like SISAL offer parallelism at all levels — among arbitrary operations, 
conditionals, loop iterations, and function calls. All control and data dependencies are local, and can be 
easily determined from the program. Various ... 
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As more complex DSP algorithms are realized in practice, there is an increasing need for high-level 
stream abstractions that can be compiled without sacrificing efficiency. Toward this end, we present a set 
of aggressive optimizations that target linear ... 
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Normal basis and Montgomery multiplications are two popular arithmetic operations in GF(2 m ). In 
general, each element representation has its associated different algorithm and hardware multiplication 
architectures. In this paper, we will present ... 
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